SMIL: Multimodal Learning with Severely Missing Modality
نویسندگان
چکیده
A common assumption in multimodal learning is the completeness of training data, i.e., full modalities are available all examples. Although there exists research endeavor developing novel methods to tackle incompleteness testing e.g., partially missing examples, few them can handle incomplete modalities. The problem becomes even more challenging if considering case severely missing, ninety percent examples may have For first time literature, this paper formally studies with modality terms flexibility (missing training, testing, or both) and efficiency (most data modality). Technically, we propose a new method named SMIL that leverages Bayesian meta-learning uniformly achieving both objectives. To validate our idea, conduct series experiments on three popular benchmarks: MM-IMDb, CMU-MOSI, avMNIST. results prove state-of-the-art performance over existing generative baselines including autoencoders adversarial networks.
منابع مشابه
Latent Low-Rank Transfer Subspace Learning for Missing Modality Recognition
We consider an interesting problem in this paper that uses transfer learning in two directions to compensate missing knowledge from the target domain. Transfer learning tends to be exploited as a powerful tool that mitigates the discrepancy between different databases used for knowledge transfer. It can also be used for knowledge transfer between different modalities within one database. Howeve...
متن کاملTowards SMIL as a foundation for multimodal, multimedia applications
Rich and interactive multimedia applications, where audio, video, graphics and text are precisely synchronized under timing constraints are becoming ubiquitous. Multimodal applications further extend the concept of user interaction combining different modalities, like speech recognition, speech synthesis and gestures. However, authoring dialog-capable multimodal, multimedia services is a very d...
متن کاملLearning with Missing Features
We introduce new online and batch algorithms that are robust to data with missing features, a situation that arises in many practical applications. In the online setup, we allow for the comparison hypothesis to change as a function of the subset of features that is observed on any given round, extending the standard setting where the comparison hypothesis is fixed throughout. In the batch setup...
متن کاملModality Convergence in a Multimodal Dialogue System
When designing multimodal dialogue systems allowing speech as well as graphical operations, it is important to understand not only how people make use of the different modalities in their utterances, but also how the system might influence a user’s choice of modality by its own behavior. This paper describes an experiment in which subjects interacted with two versions of a simulated multimodal ...
متن کاملModality Theory: Supporting Multimodal Interface Design
Modality theory addresses the following general problem of mapping task domain information into interactive multimodal interfaces: given any particular set of information which needs to be exchanged between user and system during task performance in context, identify the input/output modalities which constitute an optimal solution to the representation and exchange of that information. This pap...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i3.16330